智能论文笔记

SFF-DA: Sptialtemporal Feature Fusion for Detecting Anxiety Nonintrusively

Haimiao Mo , Yuchen Li , Shanlin Yang , Wei Zhang , Shuai Ding

分类：计算机视觉

2022-08-12

早期发现焦虑症对于减少精神障碍患者的苦难并改善治疗结果至关重要。基于MHealth平台的焦虑筛查在提高筛选效率和降低筛查成本方面具有特殊实用价值。实际上，受试者的身体和心理评估中移动设备的差异以及数据质量不均匀的问题和现实世界中数据的少量数据量使现有方法无效。因此，我们提出了一个基于时空特征融合的框架，用于非触发焦虑。为了降低数据质量不平衡的影响，我们构建了一个基于“ 3DCNN+LSTM”的特征提取网络，并融合了面部行为和非接触式生理学的时空特征。此外，我们设计了一种相似性评估策略，以解决较小的数据样本量导致模型准确性下降的问题。我们的框架已通过现实世界中的机组数据集进行了验证，并且两个公共数据集UBFC-Phys和Swell-KW。实验结果表明，我们框架的总体性能要比最新的比较方法更好。

translated by 谷歌翻译

Incorporating Prior Knowledge into Reinforcement Learning for Soft Tissue Manipulation with Autonomous Grasping Point Selection

Xian He , Shuai Zhang , Shanlin Yang , Bo Ouyang

分类：机器人

2022-07-21

先前的软组织操纵研究假设已知抓地点并可以实现目标变形。在操作过程中，约束应该是恒定的，并且软组织周围没有障碍物。为了超越这些假设，在未知的约束下（例如筋膜施加的力量）提出了一个具有先验知识的深入加强学习框架。先验知识是通过直观的操纵策略来表示的。作为代理的作用，使用调节因子来协调直觉方法和故意的网络。奖励功能旨在平衡探索和剥削的大变形。成功的仿真结果验证了所提出的框架可以操纵软组织，同时避免障碍物并增加新的位置限制。与软参与者（SAC）算法相比，所提出的框架可以加速训练程序并改善概括。

translated by 谷歌翻译

Enhancing Space-time Video Super-resolution via Spatial-temporal Feature Interaction

Zijie Yue , Miaojing Shi , Shuai Ding , Shanlin Yang

分类：计算机视觉

2022-07-18

时空视频超分辨率（STVSR）的目标是提高帧速率（也称为时间分辨率）和给定视频的空间分辨率。最近的方法通过端到端的深神经网络解决了STVSR。一个流行的解决方案是首先提高视频的帧速率；然后在不同的框架功能之间执行特征改进；最后增加了这些功能的空间分辨率。在此过程中，仔细利用了不同帧的特征之间的时间相关性。然而，尚未强调不同（空间）分辨率的特征之间的空间相关性。在本文中，我们提出了一个时空特征交互网络，以通过在不同框架和空间分辨率的特征之间利用空间和时间相关来增强STVSR。具体而言，引入了空间 - 周期框架插值模块，以同时和互动性地插值低分辨率和高分辨率的中间框架特征。后来分别部署了空间 - 周期性的本地和全局细化模块，以利用不同特征之间的空间 - 周期相关性进行细化。最后，采用了新的运动一致性损失来增强重建帧之间的运动连续性。我们对三个标准基准测试，即VID4，Vimeo-90K和Adobe240进行实验，结果表明，我们的方法可以通过相当大的余量提高了最先进的方法。我们的代码将在https://github.com/yuezijie/stinet-pace time-video-super-resolution上找到。

translated by 谷歌翻译

Collaborative Three-Tier Architecture Non-contact Respiratory Rate Monitoring using Target Tracking and False Peaks Eliminating Algorithms

Haimiao Mo , Shuai Ding , Shanlin Yang , Athanasios V. Vasilakos , Xi Zheng

分类：机器人 | 机器学习

2020-11-17

监测呼吸率对于帮助我们识别呼吸系统疾病至关重要。常规呼吸监测的设备不方便且几乎无法使用。最近的研究表明，非接触式技术（例如光摄影学和红外热成像）从面部收集呼吸信号并监测呼吸的能力。但是，当前的非接触式呼吸监测技术的精度较差，因为它们对照明和运动伪影等环境影响很敏感。此外，在现实世界中医疗应用程序设置中，用户与云之间的频繁联系可能会导致服务请求延迟，并可能导致个人数据的丢失。我们提出了一种具有合作三层设计的非接触式呼吸速率监测系统，以提高呼吸监测的精度并减少数据传输延迟。为了减少数据传输和网络延迟，我们的三层体系结构逐层分解了呼吸监视的计算任务。此外，我们通过设计目标跟踪算法和消除假峰以提取高质量呼吸信号的算法来提高呼吸监测的准确性。通过收集数据并在面部选择几个感兴趣的区域，我们能够提取呼吸信号并研究不同区域如何影响呼吸监测。实验的结果表明，当使用鼻部区域提取呼吸信号时，它在实验上表现最好。我们的方法的表现比竞争对手的方法更好，同时传输较少的数据。

translated by 谷歌翻译

Identity-Aware Hand Mesh Estimation and Personalization from RGB Images

Deying Kong , Linguang Zhang , Liangjian Chen , Haoyu Ma , Xiangyi Yan , Shanlin Sun , Xingwei Liu , Kun Han , Xiaohui Xie

分类：计算机视觉

2022-09-22

从单眼RGB图像中重建3D手网络，由于其在AR/VR领域的巨大潜在应用，引起了人们的注意力越来越多。大多数最先进的方法试图以匿名方式解决此任务。具体而言，即使在连续录制会话中用户没有变化的实际应用程序中实际上可用，因此忽略了该主题的身份。在本文中，我们提出了一个身份感知的手网格估计模型，该模型可以结合由受试者的内在形状参数表示的身份信息。我们通过将提出的身份感知模型与匿名对待主题的基线进行比较来证明身份信息的重要性。此外，为了处理未见测试对象的用例，我们提出了一条新型的个性化管道来校准固有的形状参数，仅使用该受试者的少数未标记的RGB图像。在两个大型公共数据集上进行的实验验证了我们提出的方法的最先进性能。

translated by 谷歌翻译

MIRNF: Medical Image Registration via Neural Fields

Shanlin Sun , Kun Han , Deying Kong , Chenyu You , Xiaohui Xie

分类：计算机视觉

2022-06-07

图像注册广泛用于医学图像分析中，以提供两个图像之间的空间对应关系。最近提出了利用卷积神经网络（CNN）的基于学习的方法来解决图像注册问题。基于学习的方法往往比基于传统优化的方法快得多，但是从复杂的CNN方法中获得的准确性提高是适度的。在这里，我们介绍了一个新的基于深神经的图像注册框架，名为\ textbf {mirnf}，该框架代表通过通过神经字段实现的连续函数的对应映射。 MIRNF输出的变形矢量或速度向量给定3D坐标为输入。为了确保映射是差异的，使用神经ODE求解器集成了MiRNF的速度矢量输出，以得出两个图像之间的对应关系。此外，我们提出了一个混合坐标采样器以及级联的体系结构，以实现高相似性映射性能和低距离变形场。我们对两个3D MR脑扫描数据集进行了实验，这表明我们提出的框架提供了最新的注册性能，同时保持了可比的优化时间。

translated by 谷歌翻译

Diffeomorphic Image Registration with Neural Velocity Field

Kun Han , Shanlin Sun , Chenyu You , Hao Tang , Deying Kong , Xiangyi Yan , Xiaohui Xie

分类：计算机视觉

2022-02-25

差异图像注册是医学图像分析中的至关重要任务。最近基于学习的图像注册方法利用卷积神经网络（CNN）学习图像对之间的空间转换并达到快速推理速度。但是，这些方法通常需要大量的培训数据来提高其概括能力。在测试时间内，基于学习的方法可能无法提供良好的注册结果，这很可能是因为培训数据集的模型过于拟合。在本文中，我们提出了连续速度场（NEVF）的神经表示，以描述两个图像之间的变形。具体而言，该神经速度场为空间中的每个点分配了一个速度向量，该速度在对复杂变形场进行建模时具有更高的灵活性。此外，我们提出了一种简单的稀疏抽样策略，以减少差异注册的记忆消耗。提出的NEVF还可以与预先训练的基于学习的模型合并，该模型的预测变形被视为优化的初始状态。在两个大规模3D MR脑扫描数据集上进行的广泛实验表明，我们提出的方法的表现优于最先进的注册方法。

translated by 谷歌翻译

Backdoor Attacks Against Dataset Distillation

Yugeng Liu , Zheng Li , Michael Backes , Yun Shen , Yang Zhang

分类：机器学习

2023-01-03

Dataset distillation has emerged as a prominent technique to improve data efficiency when training machine learning models. It encapsulates the knowledge from a large dataset into a smaller synthetic dataset. A model trained on this smaller distilled dataset can attain comparable performance to a model trained on the original training dataset. However, the existing dataset distillation techniques mainly aim at achieving the best trade-off between resource usage efficiency and model utility. The security risks stemming from them have not been explored. This study performs the first backdoor attack against the models trained on the data distilled by dataset distillation models in the image domain. Concretely, we inject triggers into the synthetic data during the distillation procedure rather than during the model training stage, where all previous attacks are performed. We propose two types of backdoor attacks, namely NAIVEATTACK and DOORPING. NAIVEATTACK simply adds triggers to the raw data at the initial distillation phase, while DOORPING iteratively updates the triggers during the entire distillation procedure. We conduct extensive evaluations on multiple datasets, architectures, and dataset distillation techniques. Empirical evaluation shows that NAIVEATTACK achieves decent attack success rate (ASR) scores in some cases, while DOORPING reaches higher ASR scores (close to 1.0) in all cases. Furthermore, we conduct a comprehensive ablation study to analyze the factors that may affect the attack performance. Finally, we evaluate multiple defense mechanisms against our backdoor attacks and show that our attacks can practically circumvent these defense mechanisms.

translated by 谷歌翻译

PMT-IQA: Progressive Multi-task Learning for Blind Image Quality Assessment

Qingyi Pan , Ning Guo , Letu Qingge , Jingyi Zhang , Pei Yang

分类：计算机视觉

2023-01-03

Blind image quality assessment (BIQA) remains challenging due to the diversity of distortion and image content variation, which complicate the distortion patterns crossing different scales and aggravate the difficulty of the regression problem for BIQA. However, existing BIQA methods often fail to consider multi-scale distortion patterns and image content, and little research has been done on learning strategies to make the regression model produce better performance. In this paper, we propose a simple yet effective Progressive Multi-Task Image Quality Assessment (PMT-IQA) model, which contains a multi-scale feature extraction module (MS) and a progressive multi-task learning module (PMT), to help the model learn complex distortion patterns and better optimize the regression issue to align with the law of human learning process from easy to hard. To verify the effectiveness of the proposed PMT-IQA model, we conduct experiments on four widely used public datasets, and the experimental results indicate that the performance of PMT-IQA is superior to the comparison approaches, and both MS and PMT modules improve the model's performance.

translated by 谷歌翻译

MGTAB: A Multi-Relational Graph-Based Twitter Account Detection Benchmark

Shuhao Shi , Kai Qiao , Jian Chen , Shuai Yang , Jie Yang , Baojie Song , Linyuan Wang , Bin Yan

分类：计算机视觉

2023-01-03

The development of social media user stance detection and bot detection methods rely heavily on large-scale and high-quality benchmarks. However, in addition to low annotation quality, existing benchmarks generally have incomplete user relationships, suppressing graph-based account detection research. To address these issues, we propose a Multi-Relational Graph-Based Twitter Account Detection Benchmark (MGTAB), the first standardized graph-based benchmark for account detection. To our knowledge, MGTAB was built based on the largest original data in the field, with over 1.55 million users and 130 million tweets. MGTAB contains 10,199 expert-annotated users and 7 types of relationships, ensuring high-quality annotation and diversified relations. In MGTAB, we extracted the 20 user property features with the greatest information gain and user tweet features as the user features. In addition, we performed a thorough evaluation of MGTAB and other public datasets. Our experiments found that graph-based approaches are generally more effective than feature-based approaches and perform better when introducing multiple relations. By analyzing experiment results, we identify effective approaches for account detection and provide potential future research directions in this field. Our benchmark and standardized evaluation procedures are freely available at: https://github.com/GraphDetec/MGTAB.

translated by 谷歌翻译